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Investigating the performance of different methods is a fundamental problem in graph partitioning. 
In this paper, we estimate the so-called detectability threshold for the spectral method with both 
unnormalized and normalized Laplacians in sparse graphs. The detectability threshold is the critical 
point at which the result of the spectral method is completely uncorrelated to the planted partition. 
We also analyze whether the localization of eigenvectors affects the partitioning performance in the 
detectable region. We use the replica method, which is often used in the field of spin-glass theory, 
and focus on the case of bisection. We show that the gap between the estimated threshold for 
the spectral method and the threshold obtained from Bayesian inference is considerable in sparse 
graphs, even without eigenvector localization. This gap closes in a dense limit. 


I. INTRODUCTION 

Over recent decades, significant attention has been paid to the clustering problem on graphs or networks [l|. 
Although clustering is sometimes considered as simply an optimization problem, e.g., finding the most efficient par¬ 
titioning in parallel computing, it can also be used to find physically meaningful modules, or communities, among 
which each vertex or edge shares a common attribute. The latter technique is usually called community detection, 
and is often formulated as a discrete optimization problem. Many methods and algorithms have been developed 
for clustering graphs pj , and their performance has been investigated both theoretically [Ml and experimentally 

[mu. 

One important theoretical problem concerns the so-called detectability threshold Suppose that we apply a 

clustering method to a set of random graphs produced by a generative model with a planted block structure, and 
examine the degree of correlation between the planted partition and the partition obtained by the method. The 
standard generative model is called the stochastic block model or planted partition model. As the connection between 
modules gets stronger, i.e., as the block structure gets weaker, the partition given by the clustering method will 
be less correlated with the planted partition. The detectability threshold is the critical point of a parameter that 
indicates the strength of the block structure. Below this threshold, the partition obtained by the method is completely 
uncorrelated to the planted partition. This transition is especially problematic in sparse graphs. For dense graphs, 
it has been shown that such a transition does not exist and we are always able infer the planted structure in the 
large size limit [2~il f2Gl | : as we increase the total number of nodes, the average degree also increases in dense graphs, 
and therefore we obtain more information about the module assignment of each node. Unfortunately, this is not the 
case for sparse graphs because the average degree remains constant independent of the graph size. Needless to say, 
the detectability threshold in sparse graphs is a significant problem, as many real networks are sparse. Note that, 
although we specify the algorithm, i.e., the spectral method, the existence of the detectability threshold itself is not 
an algorithmic problem, but the theoretical limit of an objective function. 

Although there are many levels of graph clustering, e.g., hierarchical clustering and clustering with overlaps, we focus 
on undirected graphs without hierarchical structures and consider the graph partitioning problem, i.e., partitioning 
of a graph into non overlapping modules. In addition, we focus on the case of bisection in the large-size limit. 

As mentioned above, the planted partition is given in the analysis of the detectability threshold. In practice, 
however, not only do we not know the model parameters a priori (such as the module sizes and the fraction of edges 
between modules), we are not even sure whether it is appropriate to assume that the graph was generated by the 
same mechanism as the stochastic block model in the first place. Therefore, knowing the detectability of a method 
does not readily contribute to its practical use. It does, however, provide an important clue for categorizing the 
detection method: It reveals the similarity of outcomes between methods through the stochastic block model. Thus, 
for this purpose, the stochastic block model should be regarded as a model that gives a measure for comparison. 
For example, although block structures exist in many senses [l|, applying all the existing methods to the given data 
obviously entails a huge cost and is also redundant. If we know which methods tend to give a similar partitioning, we 
could reduce the cost significantly. Whereas performance comparison studies are often experimental, it is promising 
that theoretical analyses can provide deeper insights. 

In this paper, we discuss the detectability threshold of the spectral method. The spectral method can be employed 
with three major discrete objective functions: ratio-cut (RatioCut), normalized-cuts (Ncut), and modularity. The 
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spectral method solves the continuous relaxation of these objective functions as an eigenvalue/eigenvector problem 
of the corresponding matrix. The unnormalized Laplacian L corresponds to RatioCut, the normalized Laplacian C 
corresponds to Ncut, and the so-called modularity matrix B corresponds to modularity. Nadakuditi and Newman 
[§] predicted that the detectability threshold of the spectral method with the modularity function coincides with the 
threshold give n by Bayesian inference [5, 6j. However, this assumes that the average degree c is sufficiently high. It 
was argued in |27( that the estimate in [81 may not be precise for sparse graphs and that there exists a gap between 
the spectral method with modularity and Bayesian inference; indeed, it was confirmed numerically that the spectral 
method does not detect the planted partition all the way down to the detectability threshold. 

It was later discovered that the spectral method with a non-backtracking matrix was a promising means of 
filling this gap. This approach provides a formalism that avoids the emergence of localized eigenvectors, known to be 
a drawback of the spectral method. A localized eigenvector is one in which the weight of its elements is concentrated 
on a few characteristic vertices. Once the eigenvector used for partitioning becomes localized, the information of 
the block structure will be washed out. Thus, preventing this effect will enhance our ability to detect the planted 
structure. 

To the best of our knowledge, however, the true detectability threshold for the spectral method in sparse graphs 
remains unknown. Therefore, we do not know to what extent the gap actually exists. Furthermore, although the 
non-backtracking matrix approach improves the detectability by avoiding the eigenvector localization problem, the 
logical and quantitative connection to detectability seems to be incomplete. That is, it is not known whether the 
gap appears because of the localization or if it exists even when the localization is absent. When both are present, 
evaluating the relative degree of their effects is an important problem. Our results show that, in sparse graphs, a 
considerable gap exists even when localization is absent and that the effect of localization may be significant when 
the degree fluctuates considerably. 

Using the so-called replica method , which is often used in spin-glass theory, we derive estimates for the detectability 
threshold of the spectral method with both unnormalized and normalized Laplacians. Note that, as pointed out 
in [Til , the spectral method with the normalized Laplacian C and the modularity matrix B are equivalent for the 
bisection problem, as long as the continuous relaxation gives a good estimate of the original discrete problem. We 
compare our analytical estimates with the results of numerical experiments for the two-block random graphs with 
uniform, bimodal, and Poisson degree distributions. Although our estimates contain some approximations, they agree 
quite well with the results of the numerical experiment, as long as localization does not occur. For the analysis of 
localized eigenvectors, we show that our estimate is fairly accurate for graphs with bimodal degree distributions. 

The rest of this paper is organized as follows. We first introduce a more precise definition of the stochastic block 
model (Sec. m and the spectral method in graph partitioning ('Sec. lIIll) . Then we derive an estimate of the detectability 
threshold in two-block random regular graphs in Sec. IIVI Note that there is no distinction between unnormalized and 
normalized Laplacians in this case. We analyze the effect of degree fluctuation for the unnormalized Laplacian L in 
Secs. |V]and|Vl] In Sec. El we present a formal solution for estimating the detectability threshold and analyze the case 
of graphs with bimodal degree distributions, and in Sec. IVII we estimate the localized eigenvector and its eigenvalue 
for a graph with a bimodal degree distribution. A similar analysis is done for the normalized Laplacian C in Secs. IVIII 
and IVIII I For the normalized Laplacian C of graphs with an arbitrary degree distribution, the resulting estimate of 
the detectability threshold resembles that of random regular graphs. Finally, we discuss the case of stochastic block 
models, i.e., two-block random graphs with Poisson degree distributions, with the normalized Laplacian C in Sec. IIXI 
We summarize our results in Sec. E 

II. STOCHASTIC BLOCK MODEL AND THE DETECTABILITY THRESHOLD 

The stochastic block model j2l| is a generative model of random graphs with a block structure, and is commonly 
used for analyzing the performance of clustering methods. While many variants have been proposed [30l - j35| . the 
model is fundamentally a generalization of the Erdos-Renyi random graph. In the stochastic block model, the number 
of modules q , size of each module, and probability p rs that vertices in modules r and s are connected are specified as 
inputs. With these parameters, the graphs are constructed as follows. Each vertex has a preassigned module index 
(7; = r (r e {1,..., g}) to which the vertex belongs. Based on this block structure, edges are generated between pairs 
of vertices at random, i.e., vertices i € r and j 6 s are connected with probability p rs . 

In the case of sparse graphs of N vertices, we set p rs = c rs /V, where c rs remains constant in the limit N —> oo. 
To construct an assortative block structure, we typically choose p rr = c- m /N and p rs = c ou t/N for r ^ s, where c- ln 
and c ou t are constants that satisfy Ci n > c ou t- In the case of two modules of equal size, we have the average degree 
c = (cm + c ou t)/2. Note that, in this case, unlike the dense case, the fluctuation of the degree of each vertex does not 
vanish in the limit N — > oo. 
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According to [5|, | 6 j, the Bayesian inference method has a detectability threshold at 

Cm Cout — 2\/c- 


(1) 


That is, even when the generative model has the assortative property c; n > c ou t, it is impossible to retrieve that 
information with any detection algorithm, unless the difference Ci n — c ou t is greater than 2 vc (for other detectability 
analyses with sufficiently large average degree, see 0,0,0,1371). 

In the present paper, we parametrize the stochastic block model differently. As mentioned above, we restrict 
ourselves to the two-block model. Instead of setting c; n and c ou t as the model parameters, we set the average degree 
c and the average number of edges 7 from one module to the other, or, in other words, the total number of edges 
7 TV between two modules. The parameters 7 controls the strength of block structure; the larger is the value of 7, the 
weaker is the block structure. It is related to c ou t and ci n by 


cTV 

~2~ 


7 TV 



_ N 

P out — “Cout, 


7 iV = 2 (|) 



( 2 ) 

(3) 


in the limit TV —> 00. Although the total number of edges between modules fluctuates in the standard formulation 
when the graphs are finite, we let every realization have exactly 7TV and cTV /2 — 7IV for the number of edges between 
modules and the total number of edges within modules, respectively. This is called the “microcanonical” formulation 
of the stochastic block model pfij . 


III. SPECTRAL METHOD IN GRAPH BISECTION 


Graph partitioning is often formulated as a discrete optimization problem for some objective function that is 
computationally difficult. The spectral method constitutes a continuous relaxation of the original problem using 
eigenvectors of a proper matrix. The unnormalized Laplacian L, which is used for RatioCut, and the normalized 
Laplacian C, which is used for Ncut, are defined as 


L = D~ A, ( 4 ) 

£ = D~ 1/2 LD~ 1/2 . (5) 

The matrix A is the adjacency matrix, i.e., Aij = 1 if vertices i and j are connected and Aij = 0 otherwise. The 
matrix D is a diagonal matrix with degree Ci of vertex i on the diagonal element, i.e., Dij = Cidij. 

Although the details differ depending on the objective function, the basic procedure of spectral bisection is the 
same and quite simple 0, j38 : j. We denote the total degree of the graph as K. For a graph with a set of vertices V 
partitioned into V\ and V 2 , the objective function of RatioCut is defined as 


/RatioCut (Ul , I'd ) 


E(V lt V 2 ) 

TV1TV2 


( 6 ) 


where the cut size E(V i, V 2 ) is the number of edges between modules V\ and V 2 , and we denote by N\ and IV 2 the 
number of vertices in each module. Similarly, the objective function of Ncut is defined as 


/Ncut(^L, V2) 


E(VuV 2 ) 
l< \ K 2 


(7) 


where K\ and K 2 are the total degrees of each modules. Using the unnormalized Laplacian L and the normalized 
Laplacian £, minimizing RatioCut and Ncut are equivalent to 


and 


min x T Lx, 
x 


subject to Xi 


y/?h/N~i i € Vi, 

-s/NUN~2 i&V 2 , 


minx 1 ' Cx, 

X 


subject to Xi 


^K 2 /Ki ie Vi, 

-^kUk~ 2 iev 2 . 


( 8 ) 


( 9 ) 








4 



FIG. 1: (Color online) A realization of two-block 4-random regular graphs. 


Finally, allowing Xi to take an arbitrary real number, we obtain the relaxed versions of the above discrete optimization 
problems. For RatioCut, 


min x T Lx subiect to *11, x T x = TV, (10) 

and for Ncut, 

min x T Cx subiect to x _L D 1 ^ 2 1, x T x = K. (11) 

xm N 

The smallest values of x T Lx and x T Cx are achieved when the x are the eigenvectors corresponding to the smallest 
eigenvalue of L and C, respectively. Note, however, that L is positive semi-definite and 1 is the eigenvector of 
L corresponding to the zero eigenvalue (i.e., the smallest eigenvalue). Because of the constraint that x must be 
perpendicular to 1, the smallest value of x T Lx in Eq. (fTOl) is achieved when we select the eigenvector corresponding 
to the second-smallest eigenvalue of L. Similarly, C is also positive semi-definite, and D l / 2 \ is the eigenvector 
corresponding to the zero eigenvalue. Hence, the eigenvector corresponding to the second-smallest eigenvalue of C 
gives the smallest value of x T Cx in Eq. m■ As the sign of Xi indicates which module vertex i belongs to in Eqs. (| 8 |) 
and (ED, we retrieve information about the optimal partition from the solutions of the relaxed problems by referring 
to the sign of each element in the eigenvector, i.e., vertices with the same sign belong to the same module. It is known 
that this prescription works well when the module sizes are not very different. Of course, it is not obvious whether 
the optimal partition of the relaxed problem coincides with that of the unrelaxed problem. However, this is beyond 
the scope of the present paper, and we concentrate on the relaxed problem, i.e., the spectral method. 

In the following sections, we analyze how the optimal partitions in the spectral method are correlated to the planted 
partitions in various random graphs. 


IV. DETECTABILITY THRESHOLD IN RANDOM REGULAR GRAPHS 

In this section, we analyze the detectability of the c-random regular graph with a two-block structure. A realization 
of such graphs is shown in Fig. |T] As c is a constant and does not increase as a function of the graph size TV*, this 
is regarded as a sparse random graph that has the property of dense graphs that the degree fluctuation is negligible. 
This is worth investigating, because the results we show in Sec. IVIII for the normalized Laplacian C are analogous 
to those we obtain for random regular graphs in this section. Although we analyze the unnormalized Laplacian L 
here, as the degree is the same for every vertex, there is no distinction between the unnormalized and normalized 
Laplacians. 

We calculate the average of the second-smallest eigenvalue of the unnormalized Laplacian [A 2 ] L , where [. ,.] L 
represents the average with respect to L of each realization of a random graph. As we increase the fraction 7 of 
edges between modules, the value of [A 2 ] L increases until it reaches the edge of the spectral band, above which [A 2 ] L 
becomes constant irrespective of 7 . As we show in the following, by calculating [A 2 ] L , we can obtain the distribution 
of the elements in the corresponding eigenvector. 

The basic methodology here runs parallel to that in [39|. To calculate the second-smallest eigenvalue, we first 
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introduce the Hamiltonian H(x\L), partition function Z(/3\L), and free energy f(/3\L), 

H(x\L) = -x t Lx, (12) 

Z(fi\L) = J dxe-P H{xW S(\x\ 2 - N)S(1 t x), (13) 

m L ) = -J^^m L ), (14) 

where x is an IV-dimensional vector, 1 is the vector in which each element is 1, and T denotes the transpose. The 
Hamiltonian corresponds to the objective function to be minimized in Eq. (USD; the factor 1/2 in Eq. (fl2l) is purely 
conventional. The S functions in Eq. (fl3l) impose the constraints in Eq. (USD- The crucial aspect of this formulation 
is that, in the limit j3 —> oo, in conjunction with the operation of (5(1 r :c), the contribution from the second-smallest 
eigenvalue is dominant in the integral in Eq. (USD- Thus, Eq. (fill) actually evaluates exp [— N/3X2/2). Hence, the 
second-smallest eigenvalue A 2 is given by 


A 2 


2 lim f{0\L). 

p —^ OO 


(15) 


We then take the average over all realizations of random graphs. However, the direct calculation of this average is 
not tractable. Therefore, we recast [A 2 ] L as 


[A 2 ]i = —2 lim -Y [lnZ(/3\L)\ L 

p—>• OO fj 

= —2 lim lim -Y Y In [Z n (p\L)\, . 
^oon^o N/3dn L v 1 nL 


(16) 


The assessment of [i?"(/3|L)]z, is also difficult for a general real number n. However, when n is a positive integer, 
[Z n (/3\L)}L can be expressed as a high-dimensional integral with respect to n replicated variables x 1 ,x 2 ,... ,x n , 
which can be analytically evaluated by the saddle-point method as N tends to infinity. In addition, the resulting 
expression of TV -1 \n[Z n {)3 \L)\l is shown to be a function of n that can be extended to real values of n under a certain 
ansatz concerning the permutation symmetry among the replica indices a = 1,2,..., n. Therefore, we employ such an 
expression to calculate the right-hand side of Eq. m- This procedure is often termed the replica method. Although 
the mathematical validity of the replica method has not yet been proved, we see that our assessment based on the 
simplest permutation symmetry for the replica indices offers a fairly accurate prediction of the experimental results. 

For n € N, we can write [Z n (/3\L)\ L as 


[z n m)]L = 


^ dx a 6(\x a \ 2 


1 


-/V)(5(l T a; a ) 



and the exponential factor is given by 



= exp 



%ja ) “t - ^ ^ %ja ) H - 2 ^ ^ ^ ^ Wijip 

ij'ev2 ieVi jevb 



(17) 


(18) 


where Uij = { Aij\i £ r,j £ r} and Wij = {Aij\i £ V\. j £ V 2 }. For the average over the random graphs, we assume 
that each realization occurs with equal probability. The condition of being a regular graph requires 


Y. uu + 

Y w ik = c 

(for i £ Vi), 

(19) 

lev 1 

kev 2 



Y u i l + 

Y w a = c 

(for j £ V 2 ), 

(20) 

iev 2 

kev 1 




and the number of edges between modules is 

Wik = ryN - 

i&v 1 fcev 2 


( 21 ) 
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Therefore, we have 


exp ( ~2 Y X “ LXa 

\ a J 

71 E {IT 15 (Y Uii + Y Wik - c ) IT 5 (Y u o i + Y w :> k - c l 5 (y Y Wik - 


Mg 


{uij}{wij} \iEVi \ieVi kev 2 


jev 2 \iev 2 kevi 


Vi£Vi keV 2 


x exp 


. y ] | y ' UijiXia x ja) + ^ ' Uij(Xi a Xj a )~ + 2 ^ ^ ' U , ij(Xi a Xja)~ 


a=l \ijeV i 


ij &V 2 


iev i jev 2 


( 22 ) 


where Mg is the number of random regular graphs with two modules (see Appendix O for the count of Mg in random 
regular graphs). With the technique in Appendix 0 Eq. (l22l) can be written as 


exp ( Y x a LXa 


J L 




jev 2 


iev i 


i<jeV i wy={0,l} 

>< n e 

i<jeV 2 «y={0,l} 


x n n e ( z i z j exp 

ieVi jev 2 wij={o,i} V 


2 tt Zj 


vn N 


2iT 


exp 

-f E( x “ - ^a*) 2 

a 

exp 

E( X “ “ 

a 


V r, y ] { x ia x ja) 


When TV TS> 1, an element in Eq. (1231) can be approximated as 


n e 

i<jev rUij ={ 0,1} 


ZiZj exp 


-§ Y^ Xia ~ x ^ 2 


exp 


exp 


ZiZj exp 

i<j£V r 


2 E( x “ - ^“) 2 

a , 


where we have introduced the order parameter functions 


g r (M w ) = -E E 

i£V r a 


for r = 1,2. Then, inserting the identity 


1 = J dQr(^ r) )6 J2 Zi P 6{x ia - ijM) - Q r (M {r) )j 

&V 0 ) ( E ^ II - /£>) - PrNQ r (^) 


= PrN 


dQ r (^ r) )dQr{pL {r) ) 


KieV r a 


for each and replacing the S functions as 


N 


S[Y X ia~ N = 


f pd(f)a Ztjy 


47T 


AT 


ME- 




47T 


(23) 


(24) 


(25) 


(26) 

(27) 

(28) 
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we can recast Eq. 0 as 

[z n (p\L)] L = p1P2 n 2 j n 


dQ r ( P ^)dQ r ( P ^) [ pdct> a pdip a [dr, 


where 


r—1,2 


27T 


n 


47r 47r / 27T 


x exp 


N 2 Kl{Qr, Qr) + 2^00 — ^2 KlIriQr, Qr) + ^ In KlII,r(Qr, {<f>a}, {^a}) 

' a r=l ,2 r=l ,2 


+ 777 — — In A/g — In c! 


(29) 


K 1 (Qr,Q r )= ^ /dM W ^ (s) Qr(Ai (r) )Q s (^ W )e- (1 -^ )7? -^ s: « (/i “ r) - ,/ “ S))2 , (30) 

r,s=l,2 ** 

KnriQr, Qr) = Pr f d/Z (r) QA^Qr^) , (31) 


A-IIIr (Qr; { 0 a}> {^a} ) = nn cfe,; a (Qr(£Ci)exp 


igy r a=l ieVr 


„ ^a x ia ) 


(32) 


Now we evaluate lim 7 v-s.oo(ln[Z n (/ 3 |L)]i)/A^ with the saddle-point method and calculate the second-smallest eigen¬ 
value according to Eq. m■ For this, we assume that the functional forms of Q r ( P ) and Q r (/z) are invariant under 
any permutations of replica indices a £ {1, 2,..., ro}, which is often termed the replica-symmetric ansatz. Further, 
the Gaussian nature of the current problem allows us to assume that Q r (p) and Q r (p) are mixtures of Gaussian 
functions; this originates from the fact that the effective Hamiltonian yielded by appropriate exponentiations of the 
S functions is composed of quadratic forms. These restrict the functional forms of Q r (/z) and Q r { P ) as 


Q r { P ) = T r 


Q r { P ) = T r 


dAdH q r {A 1 H) 



exp 


PA 

2 


E 



dAdH q r (A , H) exp 


n n 

2 E (E-'o + 2 H P 2j 



(33) 

(34) 


i.e., some superpositions of Gaussian functions with weights q r (A 1 H) and q r (A,H ), where A and A denote the 
variances and H and H denote the means of each Gaussian distribution, respectively. When n = 0, the order 
parameters Q r (/x) and Q r (/x) coincide with Eqs. (IA9I) and (IA10D in Appendix lAl and thus the normalization factors T r 
and T r are equal to Eqs. (IA19I) and (IA20I) . With Eqs. (1351) and (1551) . (1501) (1551) become functions of n that are extendable 
to real values of n. Inserting these expressions into the identity N -1 [In Z(/3\L)\ l = lim n _ > o(c?/3n)lV " 1 In [Z n (/3\L)] L , 
we have 


[\ 2 ] l = - extr i [dAdH [ dA'dH'Z(A, H, A', H') 

9 r,gr, 0 ,d I J J 


C P 1 P 2 Pi 


P 2 


- + E ) qi (A, H) qi (A\ + ( - + r ) 95 ( 24 , H)q 2 (A', H’) + 2 (1 - r) qi (A, H)q 2 {A ’, H’) 


Pi 


' — c ^2 Pr j dAdH / dAdH q r {A 1 H)q r (A 1 H) 

r— 1,2 d d 


(H + H) 2 H 2 


c c 

E Pr / II ( dA g dH g q r {A g ,H g 


r—1,2 u g= 1 


2 -E.A 


A-A 

2 


1 S g A g 


(35) 


where we set 


cpip 2 ’ 

(1 + A’)H 2 + (1 + A)H' 2 + 2HH' H 2 H' 2 
’ ' ~ (1 + A)(l + A') — 1 T ~ IT ‘ 


(36) 

(37) 
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In the above calculation, we have assumed the replica symmetry for (p a and ip a , i.e., <j> a = <f> and ip a = ip, respectively, 
for a = 1, 2,..., n. The saddle-point conditions yield the following set of integral equations: 


UAH) 

UAH) 

and 


dA'dH' \l + 1 ^Y)p 1 q 1 {A',H') + {l-T)p 2 q 2 (A',H') 
dA'dH' I l + ^r)p 2 q 2 (A',ff') + (l-r)p 1 q 1 (A',ff') 


S(A+ — ““T7^ s(h- H ' 


S[A + 


1 + A' 
A' 

1 + A' 


6 [ H — 


1 + A! ) ’ 
H' 


i + A' r 


c — 1 


UA H) = / H (dA g dH g q r {A g ,H g j) 5 Ih + i/j/2 - y H g ) <S l A - </> + y A g 


9=1 


9=1 


9=1 


(38) 

(39) 

(40) 


Note that we set c > 2 here. To obtain nontrivial random regular graphs, the degree c of each vertex needs to be 
greater than 2. Furthermore, the saddle-point conditions with respect to the auxiliary parameters ip and <p give 


I dAdH Qr(A, H)— = 0 


A 


^2,p r I dAdH Q r (A,H) ( ^ ) = 1, 


A 


respectively, where we have defined 


Q r (A,H)= I H (dA g dH g UAg,H g j) 8 Ih + ip/2-'*T t H g \ 6 Ia- 


9=1 


9=1 


^2 a) > 

9=1 / 


(41) 

(42) 

(43) 


which corresponds to the complete marginal of Eq. pop . The distribution of A and H in module r, Q r (A , H ), can be 
obtained by iteratively updating the saddle-point equations (1551) CU1) . while keeping the constraints (TUI) and ATI) . 

Recall that <p leads to the normalization condition JT X i/H = 1 (see Eq. (17710 . Then, Eq. ATI) should indicate the 
same restriction, as it incorporates the average over the random graphs at the same time. Hence, Eq. (1431) indicates 
that the distribution of H/A gives the distribution P r {x) = Yliev $(x — Xi)/N r of the elements of the second-smallest 
eigenvector of each module. As shown in Fig. ( 2 j the distributions obtained by iterating the saddle-point equations 
agree very well with the numerical experiments. 

To obtain an analytical expression, we further restrict the form of the solution. We assume that the variances in 
Eqs. (ESP and (EA!) have the same values a and a, i.e., q(A) = f dHq(A, H) = S(a — A) and q{A) = f dHq{A, H) = 
S(a — A). Then, Eq. (l35l) becomes 


[A 2 ] l = -extrj ^ ^ ^cpi [(1 + a)m 21 + m^] + cp 2 [(1 + a)m 22 + m\ 2 ] - 7 (mu - mi 2 ) 2 j 


a (a + 2 ) 

-- V cp r (m 2r + 2mi r rhi r + m 2r ) 

n. — n ^ 


r—1,2 


+ ■ 


y Pr 7 — ciprhi r + cm 2r + c(c- l)rh\ r j + </>?, 

r=l,2 ' ’ ) 


(44) 


where we have denoted the moments of H and H as m nr = f dHH n q r (H) and m nr = f dHH n q r (H). Equation ATI) 
has solutions with m\ r ^ 0 and m\ r = 0. For the solution with m\ r ^ 0, after taking the saddle point, we have 


1 + CL — 


1 


1 + a’ 
a = <p — (c — l)a, 


mn = 


P 2 

cpi 




c — 1 — 


(45) 

(46) 

(47) 


IU L = (1 — T) 


(48) 
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(a) 


(b) 




FIG. 2: (Color online) Distributions of the elements of the second-smallest eigenvector of the two-block random regular graphs. 
The dots in each plot represent the numerical results with N = 10 4 vertices, in which the average is over 100 samples. The 
solid lines in each plot represent the results from the saddle-point equations (1381) (1401) with parameters (a) c = 3, pi = 0.7, 
P 2 = 0.3, 7 = 0.1, and (b) c = 4, pi = 0.6, P 2 = 0.4, 7 = 0.1. 


The requirement that rhh > 0 implies that the above solution is valid for 


1 

y/c — 1 


< r, 


or, in terms of 7 , 


7 < cf(c)pip 2 , 


(/(c) = 1 - 


1 

y/c— 1 


(49) 


(50) 


The point at which the equality holds in Eq. (15(1 is the detectability threshold of the random regular graphs. Above 
this point, we have the solution with m\k = 0. When rh\k = 0, we have 


[A 2 ]i = -<p = c- 2 y/c- 1 , 


(51) 


which matches Eq. C51) at the boundary of (1501) . In both cases, we have ip = 0, which comes from the symmetry 
property in which the problem is invariant under conversion from x to — x. Equations (1481) and (1511) are plotted in 
Fig. OHa), together with the results of the numerical experiments. Again, the agreement is quite good. 

To compare our results with those reported in the literature, we recast Eq. (14911 in terms of Cj n = p- ln N and 
c 0 ut = Pout A', and set the module sizes to pi= P 2 = 0.5. First, we recast Eq. (15U1) in terms of the total degree within 
a module K m and the total degree from one module to the other K out . With these quantities, we have K out = 7 IV 
and K = cN = 2(A) n + K out ) for the total degree of the whole graph K. Then, Eq. (l50l) reads 


ATjn AT out 7 


N c 

~2 


(52) 


In the limit TV — > 00 , K- m {= cN /2 — jN) and AT out (= 7 N) are related to Cj n and c ou t as Eqs. © and ([3]), respectively. 
Therefore, the detectable region (l52l) is 


Cin C 0 ut -IT 2 . -- (53) 

VC — 1 

In the dense limit c —> 00 , Eq. (l53l) converges to the result in Q. The implication of Eq. (l53l) is that, even if the effect 
of localization is absent, the spectral method for sparse graphs never reaches the ultimate limit 2 y/c )5), 13,3• 

Although the distribution P r (x ) is not of Gaussian form, even for the case of equal size modules [see Figs. |U[a) 
and [U(b) f° r the apparent cases], it is expected to be somewhat close to the Gaussian distribution, especially when 
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(a) (b) 




FIG. 3: (Color online) Average second-smallest eigenvalues of the unnormalized Laplacian L of the random regular graphs as 
a function of (a) 7 and (b) Ci n — c ou t- The solid lines represent the analytical solutions 1 (1461) and (EH)] and the dots represent 
the numerical results. The numerical experiments used N = 10 4 vertices, and each dot represents the average over 100 samples. 


the peaks are not well-separated. Estimating the mean m(x) and variance s 2 (x) from the replica analysis above, we 
can calculate the mean value of the fraction of correctly classified vertices as [1 + erf(|m(:r)|/i/ 2 s 2 (ir))]/ 2 , under the 
Gaussian approximation (see Appendix [B] for details). Figure [5] illustrates this estimate, together with the results of 
numerical experiments and the populations obtained by iterating the saddle-point equations. The Gaussian fitting 
appears to give a fairly good approximation, especially around the detectability threshold. We can also observe some 
convergence of the numerical results to our estimate, which is for the infinite-size limit. 

Note that there exists a finite fraction of misclassified vertices, even in the limit N —> 00 . Although the poor 
performance of optimization algorithms may have crucial effects in practice, misclassification occurs in principle 
because the planted partition is not necessarily the partition that optimizes the objective function. Unless the block 
structure of the planted partition is sufficiently strong, the random graph is likely to have a partition that is better, 
in the sense of the objective function, than the planted partition. 

We now discuss the behavior of the detectability threshold (1501) . We consider 7 /c, the fraction of connections 
between modules normalized by the degree. The upper bound of this quantity is min{pi,p 2 }, which is achieved when 
all edges in a smaller module are connected to the other module, i.e., the case where the small module has a bipartite 
structure. For a uniform random graph, i.e., a graph with no block structures, the expected value of 7 /c is p\P 2 - This 
is because, for each stub or half-edge in the first module in cNi, the probability of being connected to the second 
module is N2/N. Therefore, we have 7 IV = cN\ x N2/N . This can also be obtained as the value of 7 with 7 = 0 in 
(IA15D . Note that the parameter 1 — T is the ratio of 7 to cp±p 2 - Because 0 < /(c) < 1, the detectability threshold 
7 c /c falls somewhere in the range (0,pip2) (see Fig. [ 6 J. 

Let us consider the value of 7 /c such that the total degree within a module is greater than the number of edges 
between modules. As the number of edges within a smaller module is cNmm{pi,p 2 } — 7 -/V, we have 

— < imin{pi,p 2 }. (54) 

c 2 

This corresponds to the region where the weak definition of a community [40[ is satisfied with respect to the smaller 
module. While the detectability threshold (1501) is always in the region stated in (l54l) for equal-size modules, this may 
not be the case for unequal-size modules, because the larger module may possess a strong block structure even if the 
smaller one does not. Such a situation is achieved when 2/(c) msLx{pi,p 2 } > 1. In other words, because /(5) = 1/2, 
the spectral method always loses all information about the planted solution when the weak definition is not satisfied 
in any module size for c < 5. Note that this is for bisection using the sign of the eigenvector with the second-smallest 
eigenvalue. Although this is the standard approach, as we mentioned at the end of Sec. IIIII its performance is not 
reliable in practice when the module sizes are very different. 
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(a) (b) 




FIG. 4: (Color online) Gaussian fitting of the distributions of elements for the second-smallest eigenvector of the two-block 
3-random regular graphs. The values of 7 are (a) 0.1 and (b) 0.05, and the equal module sizes are considered in both plots. 
The dots in each plot represent the results of the numerical experiments with N = 10 4 vertices, in which the average is over 
100 samples. The solid lines in each plot represent the Gaussian distributions with the same mean and variance as the values 
in the numerical experiments. 



FIG. 5: (Color online) Estimate of the fraction of correctly classified vertices in the two-block 3-random regular graphs with 
equal module sizes and the numerical results. The crosses represent the estimates using the replica method, and the solid line 
represents their Gaussian approximation. The numerical experiments were conducted for various graph sizes. 


V. DETECTABILITY THRESHOLD IN RANDOM GRAPHS WITH DEGREE FLUCTUATION: THE 

UNNORMALIZED LAPLACIAN 

We now analyze the case where the degree fluctuates in the unnormalized Laplacian L. Within N r = p r N vertices, 
we consider a graph in which b t N r (t £ {1,2,..., T}, Y^t-i = 1) vertices have degree c t . In the limit N —> 00 , the 
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7c 


= f{c)PlP2 


7 

C 


FIG. 6 : (Color online) Parameter region of the detectability threshold for random regular graphs, /(c)pip 2 , in the space of 7 /c. 
This is in the range ( 0 ,pip 2 ), whereas the region in which the weak definition of a community is satisfied is below min{pi,p 2 }/ 2 . 






W 

weak definition 

0 1 

-mm{p 1 ,p 2 } P 1 

P 2 min{j 

, 0 

h,P2i 

P 

5 


replica-symmetric solution of the second eigenvalue is given by 
[\ 2 ] l = - extr \ [ dAdH [ dA'dH' E(A, H, A', H') 


X 2 ( ( cpi ~ 7 ) 91 ( 2 ! ,H)q 1 (A',H') + (cp 2 - 7 ) 92 (A H)q 2 {A ', H') + 2751 (A, H)q 2 (A', H') 


' —c 55 p r / dAdH / dAdH q r {A 1 H)q r (A, H) 

r=1,2 ^ 


(H + H) 2 H 2 
A-A A~ 


+ 


/* C-t 

Pr bt / n ( dA g dH g q r (A g , iT s 


s=l 


2 - 




’ ^9 ^9 


The saddle-point equations derived from (1551) are 

qi(A,H)= [ dA'dH' \(l-^L] qi (A',H') + ^q 2 {A',H')\s(A- 

J L\ c piJ c vi J V 

fo{A,H)= [dA'dH' \(l-^L) q 2 (A',H') + ^ qi {A',H')}s(A 
J [\ cp 2 J cp 2 J V 


A! 


1 + A' 
A' 

1 + A' 


6 yH — 
S\H- 


H’ 

TTa 7 

H' 

1 + A' 


and 


q r 


(A,H) = Y J hj ^ j LI ( dAgdHMAg , H a )) 8 ( H + | - £ H t 


. C t — 1 




Ct — 1 


9=1 


9=1 


/ ct — 1 N 

<5 I A — (j) + 55 A g 

\ 9=1 J 


(55) 

(56) 

(57) 

(58) 


Compared with the saddle-point equations for the random regular graphs, we now have a step in (1551) to draw a degree 
Ct from the excess degree distribution b t ct/c at every iteration. 

Note that, in the case of the random regular graphs, we had a set of solutions q r (A,H) and q r (A,H) of the 
saddle-point equations m m, with A and A fixed at certain values a and a. Although these give exact solutions 
with random regular graphs, functions of this form cannot solve Eqs. (|56l) (l58l) because of the degree fluctuation. 
Although the saddle-point equations (1561) (1551) no longer have solutions of simple form, it is still important to obtain 
an analytical expression. For this purpose, we again employ q r {A) and q r (A) of the form q r (A) = S(a — A) and 
q r {a — A), respectively. This is called the effective medium approximation (EMA) [39[, |4l|. We can then recast 

Eq. (1551) as 

[A 2 ]^ = —extr j</>+ ^ ^ ^c55?+ [(! + a)m 2r + m\ r ] - 7(mn - mi 2 ) 2 j 

— c ^ r (rri2r + 2mi r mi r + m2r) 


a — a 




eta 


ip 2 


+ C t {m 2 r - ipmir) + c t {c t - 1 )rh\ r 


(59) 
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Introducing 


Rn = J2 

t 


i>«=f 

(t>-c t a 

b t c ? 

{(j) - c t a ) 2 ’ 


and provided that a and (f> are obtained at the saddle point, we have 

Xi — 5 2 


TO n = 


(5 2 - 5 1 )X 1 X 2 + (5r5 3 - 5 2 )A 2 - 5rA 3 ’ 


Pi 


[A 2 ] l = - 


V 2 


— _ i? 2 - ^1 p2 


cl{q T 2) (Id - &) 


(60) 

(61) 


(62) 

(63) 


where we set 


We have mn = 0 when 5 2 > X\, i.e., 


r = 


Ah 


Cpip 2 ’ 

~9-(a 2 + 2 a + 2 ), 
c 


X 2 


Pi 

P2 



X 3 


2 RIpi r 

c p 2 1 + a 


(l + a) 2 <^-l. 


(64) 

(65) 

( 66 ) 

(67) 

( 68 ) 


Equation (1651) gives the formal solution for the second-smallest eigenvalue, and the equality condition in (1551) is our 
estimate of the detectability threshold with the EMA. 

When the graph has a bimodal distribution, i.e., {b i, & 2 } for Ci and c 2 , we can solve for a and </> at the saddle point 
analytically. In this case, the saddle-point conditions give 


CiC 2 a (1 + a 

1 CL 


+ 1 - c 


c 2 - c 1 + 


1 -)- CL 


(69) 


where we have defined c 2 


b\c\ + 5 2 c|, and a is the solution of the following quadratic equation: 


c 


1 A CL 


2 


+ 



ClC 2 \ 

r-17 


1 A Ci 

r 


CiC2(l - C) = p 

r-1 


(70) 


We take the smaller value for the solution of Eq. ( 7 QD, which gives a nonnegative value for to 2 ± . 

Figures [71(a) and [71(b) show the eigenvalues of the unnormalized Laplacian with the EMA and those of the localized 
eigenvectors, together with the results of the numerical experiments. The estimate of the localized eigenvectors 
and their eigenvalues is discussed in Sec. ED and we describe the resulting behavior here. As 7 increases, the 
eigenvector possessing information about the modules will eventually have a higher eigenvalue than that of the 
localized eigenvector; the standard spectral method fails in such a region. Our estimate agrees very well with the 
numerical result, as long as localization does not occur. Although we do not know which g should be chosen a priori 
(see Sec. IVIl for the meaning of g ), if we choose a value that is consistent with the result of the numerical experiment, 
our estimate of the localization transition is close to the point where the result given by the replica method with the 
EMA starts to deviate from numerical result. 

Eigenvector localization is expected to result from the existence of a few vertices with irregular degrees, which we 
call the defects. In Figs. (71(a) and[71b), we regard the vertices with the lower population as the defects. As we show in 
Secs. IVII and IVIlIl localization tends to occur in both unnormalized and normalized Laplacians when the defects have 
a lower degree. Therefore, when we have equal populations, i.e., 61 = 6 2 = 0.5, we regard vertices with lower degree 
as defects. Indeed, when vertices with lower degree are dominant, localization does not seem to occur, or produces 
only a negligible effect. 
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FIG. 7: (Color online) Average second-smallest eigenvalues of the unnormalized Laplacian L of the two-block random graphs 
with bimodal degree distributions, as a function of 7 . The degree sets are (a) { 01 , 02 } = {3,6} and (b) { 01 , 02 } = {3,9}. The 
ratio of module sizes is set to pi = 0.6 (p 2 = 0.4) in both cases. The estimates given by the replica method with the EMA 
and the estimated eigenvalues for the localized eigenvectors are represented by the solid lines (see Sec. eh for details of the 
localized eigenvectors). When the latter gives a lower eigenvalue, the former is indicated by a dashed line. The dots represent 
the numerical results with N = 10 4 . The average is over ten samples. In each plot, results are plotted, from top to bottom, for 
bi = 0.1, 0.5, 0.9 (& 2 = 0.9, 0.5, 0.1). 



FIG. 8 : (Color online) A tree with the defects aggregated around its root. 


VI. LOCALIZED EIGENVECTOR OF THE UNNORMALIZED LAPLACIAN WITH BIMODAL 

DEGREE DISTRIBUTIONS 

The behavior of the eigenvalues of localized eigenvectors for the unnormalized Laplacian L was described in Sec. El 
This section is devoted to their derivation and an analysis of some specific examples. As mentioned in the previous 
section, localized eigenvectors emerge because of degree fluctuations. They are the vectors in which the weight of 
their elements is concentrated around a few defects, the vertices with characteristic degrees. Here, we analyze the 
localization of an eigenvector for the unnormalized Laplacian L and consider this process for the normalized Laplacian 
C in Sec. IVIIII We focus on sparse graphs with bimodal degree distributions. The graphs have two types of degree, 
cd and eg, with populations 6b and bg (6b + bg = 1), respectively. We let bg < bg and refer to the vertices with 
degree cb as the defects. 

As frequently analyzed for sparse graphs in the literature [H, l~iTl [ril l (see Fig. [8]) , we consider a tree with defects 
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aggregated around its root, up to a distance g from the root. Hereafter, we denote a localized eigenvector as v. We 
now simplify the graph by letting all vertices at distance d > g have a uniform degree eg• Then, the symmetry of the 
graph implies that, for all vertices located at distance d from the root, m = V d . From the definition of the eigenvector 
Lv = Xv , for g > 1, we have 

(eg — A)Vo — eg Vi = 0, 

(cd — l)Vd+i - (cd — A )Vd + Vd- 1 =0 (0 < d < g), 

(c B -l)V d+1 -{cB-X)V d + V d -i=0 (d>g+ 1). (71) 

To be a localized eigenvector, the element V d needs to vanish at d —> oo. Thus, we choose a solution of the form 
Vd+i = nV d with |re| < 1 for d > g, where k satisfies 

(cb ~ 1)« 2 — (eg — X)k + 1 = 0. (72) 


In addition, we have the constraint that the norm of the eigenvector needs to be finite, i.e., \v\ 2 < oo. Let us consider 
the case g = 0, i.e., only the vertex at the root is defective. From Eq. CD. A must satisfy 


cd — A 

CD 


Vi 

Vo 


V2 

Vi 


k(A), 


(73) 


where re(A) is a function of A determined by Eq. (1771) . The resulting non-zero eigenvalue is A = cd(cd — Cd — 2)/(cg — 
cd — 1), and the corresponding damping factor k is k = l/(cg — Cd 1)- In addition, the constraint finite norm 
requires cd < eg — 1 — s/cb — 1. Interestingly, the defect must have a lower degree than the other vertices, whereas, 
in the case of the adjacency matrix, it was hubs that caused localization. Note, however, that this is the condition 
for a tree with uniform degree at d > 0 and does not hold in general. 

Let us now consider using the results given by the replica method with the EMA from the previous section in the 
case where the vertex degrees at d > g are not necessarily uniform. Recall that [A 2 ]g = —(j) when m\\ = 0, and that 
a satisfies Eqs. (l45l) and (l46l) for random regular graphs. From these relations, we see that the factor 1 + a obeys the 
same characteristic equation as CZ2D- This connection can also be seen from the saddle-point equation; Eqs. m ® 
yield dH/dH g = (1 + a) -1 = 1 + a. This implies that, in tree-like graphs, the response function dVi/dHj generally 
damps as 0(( 1 + a) D ^ ,J l), where D(i,j) is the distance between sites i and j. In particular, taking vertex j to the 
root (g = 0) and comparing the relation with the solution form V d = const n d for d > g, where const is proportional 
to Vo, we conclude n = 1 + a. Hence, one way to estimate the localized eigenvector and its eigenvalue is to replace 
the damping factor k with 1 + a*(A), where a*(A) is the value of a obtained via the EMA as a function of A. Then, 
analogously to the example above, we numerically compute the eigenvalue A so that it is consistent with the equations 
for d < g in Eq. CD, i.e., 


^r i = ^ = 1 + a*(A). (74) 

Vg Vg+1 

If this eigenvalue is smaller than the second-smallest eigenvalue calculated in the previous section, we can conclude 
that the localization transition has occurred. 


VII. DETECTABILITY THRESHOLD IN RANDOM GRAPHS WITH DEGREE FLUCTUATION: THE 

NORMALIZED LAPLACIAN 

We now analyze the spectral method with the normalized Laplacian C and consider the two-block random graph 
with degree fluctuations. With y := D~ 1 ^ 2 x, the Hamiltonian can be written as 

H(x\C) = ^x t £x = ^y T Ly, (75) 

and, noting that the total degree reads K = cN , we define the partition function in terms of y as 

Z{p\C)= J dye-P H{ylL) 6(y T Dy-cN)5(l T Dy), (76) 

where we have omitted the constant factor obtained by defining the partition function in terms of x. The free energy 
defined by this partition function is related to the second-smallest eigenvalue according to 

2 lim [mC)\c = - 2 i im Tro M/31 C)] c = c [A 2 ] £ . 

/3—>oo /3->oo JV p 


( 77 ) 
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Note that the vector y is not the eigenvector of C , and therefore the distribution P(y) does not give the distribution 
of the second-smallest eigenvector. However, as the sign of each vector element is not changed by the conversion from 
y to x, the fraction of correctly classified vertices is readily obtained from the distribution P(y). 

In the limit TV — > oo (as in Secs. m and 0 , the replica-symmetric solution can be obtained as 


c [A 2 ] £ = — extr i dAdH dA'dH'E(A,H, A', H') 

q r ,q r ,0,ip I J J 


X 2 y^P 1 ~ l)<p{ A i H )qi{A ,H') + ( cp 2 - j)q 2 (A, H)q 2 (A', H r ) + 2jqi(A, H)q 2 (A', H') 

+ c<fi — c p r f dAdH f dAdH q r (A, H)q r (A, H) | + J -— J 

r—1,2 J J \ A ~ A J 


+ 


J2 pr /n( dA g dH g q r (A g , H\ 


r—1,2 t 


9= 1 


a (a^/2-E ,»,) 

' c t(t> ~ Eg A g 


(78) 


We have the same saddle-point equations as (1551) and (1571) for q r (A , H). For q r (A, H ), we have an analogous equation 
to (1581) . but with ip and <p replaced with Ctip and Ct<p, i.e., 


q r (A,H) = J2 b -Y- I II (dA g dH g q r (A g ,H g ))s^H + ^-J2H 9 ) S ( A -ctcP+J2 A : 


, c t — l 


c t ip 


ct — 1 


Ct — l 


(79) 


t g=l 

With the EMA, Eq. (1751) is approximated as 


< 7=1 


s=i 


[A 2 } c = -extri (p+ 1 ( Y ^p k [(1 + a)m 2k + m\ k 1 - -(ran - toi 2 ) 2 ) 

l a ( a + ^\ k C / 


a — a 


y ^Pk ( rn 2k + 2m lk m lk + m 2k ) 


1 


c(cp — a) 


— f■ + Pk (rh 2k - ciprhik + (c - l)m 2 fc ) 


fc=l,2 


From the saddle-point conditions, for the solution with m\ k ^ 0, we have 

1 T h = 


1 + a ’ 

a = C(p— (c — l)a, 

2 /- i \2 7*2 j . 

11 = ( ; Si 1 


i - r / i 
[A 2 ] £ = — 0= —fc-l- = 


(1- 1 —^) 

((c- i)r 2 -1) 

V (c- i) 2 r / 



and in the region where m\ k = 0, we have 


[A2] c = -<j> = 


(v^t-i y 


Again, ip = 0 in both cases. The condition for the solution with m\ k ^ 0 to exist is 

1 


\Jc — 1 


< r, 


(80) 

(81) 

(82) 

(83) 

(84) 

(85) 

( 86 ) 


and the detectability threshold is the condition when the equality holds. 

Figures Oa) and|HKb) plot Eqs. (1571) and (1551) . together with the results of the numerical experiments for bimodal 
distributions. These figures are plotted in the same way as in Figs. [71(a) and [71(b). Again, our estimate gives a fairly 
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FIG. 9: (Color online) Average second-smallest eigenvalues of the normalized Laplacian C of two-block random graphs with 
bimodal degree distributions as functions of 7 . The degree sets are (a) { 01 , 02 } = {3,6} and (b) { 01 , 02 } = {3,9}. The ratio 
of module sizes is set to p\ = 0.6 (p 2 = 0.4) in both cases. The estimates given by the replica method with the EMA and 
the estimated eigenvalues of the localized eigenvectors are represented by solid lines (see Sec. IVIlH for details of the localized 
eigenvector). When the latter gives a lower eigenvalue, the former is indicated by a dashed line. The dots represent the 
numerical results with N = 10 4 . The average is over 10 samples. In each plot, results are plotted, from top to bottom, for 
61 = 0.1,0.5, 0.9 (b 2 = 0.9, 0.5, 0.1). 


accurate prediction of the numerical results ( 44 }. As conjectured in Sec. IVIIll a comparison of Figs. [7}a) and [Tib) 
and Figs. [HI a) andj^b) shows that localization is less likely to occur with the normalized Laplacian C than with the 
unnormalized Laplacian L. 

Figures [Tol a) andQjJb) show the fraction of correctly classified vertices, ignoring the effect of localization. Although 
our estimates are close to the numerical results when the effect of localization is negligible, they differ significantly in 
the region where localization is present. Note that the difference between the crosses and the solid line in each figure 
is not totally due to the Gaussian approximation of the distribution of the eigenvector elements (see Appendix [C] for 
details). In this case, the Gaussian fitting (solid line) whose mean and variance are estimated by the EMA of the free 
energy seems to be a better approximation. 


VIII. LOCALIZED EIGENVECTOR OF THE NORMALIZED LAPLACIAN WITH BIMODAL DEGREE 

DISTRIBUTIONS 

In this section, we analyze the localized eigenvectors of the normalized Laplacian C. The behavior of the eigenvalues 
of these localized eigenvectors for the normalized Laplacian C were described in Sec. IVII1 and this section is devoted 
to their derivation and an analysis of some specific examples. 

We consider the tree in which the defects are aggregated around its root up to a distance of g. The analysis here is 
completely analogous to that for the unnormalized Laplacian L in Sec. IVII The equations corresponding to (17T1) for 
g > 1 are 

Vi - (1 - A) V 0 = 0, 

(CBC 2 )- 1/2 (C 2 ^ 1)V„+1 - (1 - A )Vg + C^Vg-! = 0, 
c- b \c b - l)V g+ 2 - (1 - A)V g+ i + ( c B c 2 )- 1/2 V g = 0, 

Cb\c b - l)V d+ i - (1 - A)V d + c^Vd-! = 0 (d>g + 2). (87) 

As in Sec. ED we consider the case where the vertices at d > g have a uniform degree c B . Interestingly, when <7 = 0 , 
an analogous calculation as for the unnormalized Laplacian L in Sec. IVII yields that the eigenvector with a non-zero 
eigenvalue of the form V^+i = nVd f° r d > 1 has k = — 1, i.e., we never have a localized eigenvector. Moreover, when 
7 = 1, the condition 2 c B < c B must be satisfied to give a localized eigenvector with a finite norm; again, the defects 
need to have a lower degree. Note also that the results obtained here are more severe than the conditions in Sec. IVII 
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(a) (b) 



FIG. 10: (Color online) Fraction of correctly classified vertices with the spectral method for two-block random graphs with 
bimodal degree distributions as functions of 7 . The normalized Laplacian £ is used, and the degree sets are (a) {ci, C2} = {3, 6 } 
and (b) {01,02} = {3,9}. The population of each degree and the ratio of module sizes are b 1 = 0.5 (62 = 0.5) and p\ = 0.5 
{jp 2 = 0.5) in both cases. The dots represent the numerical results with various graph sizes. The crosses represent the results 
obtained by iterating the saddle-point equations (15(11) . (I571i . and (EH) . The solid lines represent the results obtained by fitting 
the distribution of eigenvector elements to a Gaussian distribution whose mean and variance are estimated by the saddle point 
of the free energy with the EMA (see Appendix [B] for details). 


for the unnormalized Laplacian L in the sparse case. This implies that localization tends to be suppressed in the 
normalized Laplacian C. In fact, this tendency is consistent with the analysis in data clustering [38l. EBI]. 

In the case where the vertex degrees at d > g are not necessarily uniform, a localized eigenvector can be estimated in 
almost the same way as for the unnormalized Laplacian L in Sec. lVIl utilizing the results of the replica method with the 
EMA in Sec. IVin Note that, unlike the case of the unnormalized Laplacian L , the value of Cb is needed to determine the 
ratio V g+2 /V g+l in Eq. (l87l) . Thus, when we solve for the consistent eigenvalue by V g+ 2 /Vg+i = V g+ z/V g+ 2 = l+a*(A), 
we replace cb in the ratio V g + 2 /V g +i with the average degree c. 


IX. DETECTABILITY THRESHOLD OF THE STOCHASTIC BLOCK MODEL WITH THE 

NORMALIZED LAPLACIAN 

Finally, we consider the eigenvalues and detectability threshold in the stochastic block model, i.e., the random 
graph with a Poisson degree distribution. To compare with the literature, we recast our result in terms of Ci n and 
c ou t and set the module sizes to pi = p 2 = 0.5. With these quantities, the detectability threshold in the normalized 
Laplacian, Eq. (I5S1) . reads 

2 c 

Cin ^ Cout = —==■ (88) 

\/c — 1 

Compared with the threshold obtained in Q, we have a correction factor of — 1. The phase diagrams of 

these two thresholds are shown in Fig. [TlJ While the difference between them is negligible when the average degree 
is sufficiently large, considering the fact that the upper bound of the parameter Cj n — c ou t is 2c, this gap is indeed 
considerable in sparse graphs. 

We can compare the average estimate of the second-smallest eigenvalue [A 2 ] £ with the numerical results. In terms 
of c; n and c ou t, Eq. (1M1) reads 

[A 2 ] £ = 1 - (Cin - Cout) - --. (89) 

2c Cin - Cout 

Figures EH a) and [T3l' a) show that the estimated eigenvalues with the EMA agree excellently with the numerical results, 
as long as the localized eigenstate does not occupy the second-smallest eigenvalue. We measured the localization 
strength with the inverse participation ratio (IPR), defined as YliLi x t /(Si=i x i ) 2 f° r a vector x , and have plotted 
this in Figs. rnil' b) and lTSl b). The IPR grows rapidly below the point at which the estimates start to deviate from the 
numerical results in Figs. irUT a) and H3Ta). Similarly, as shown in Figs. CEUt) and fldl c). our estimates for the fraction 
of correctly classified vertices start to deviate from the numerical results at that point. 
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undetectable 


8 


10 


FIG. 11: (Color online) Phase diagram of the detectable and undetectable regions given by the replica method with the EMA 
(solid line) and the ultimate threshold um (dashed line). The model cannot take parameter values with Ci n — c ou t > 2c 
(invalid region) because of the condition that Cout > 0. 


For the stochastic block model with c = 6, the localized eigenvector appears in the region significantly above the 
detectability threshold, i.e., in the detectable region, and its effect is not negligible. However, for the stochastic block 
model with c = 8, although the estimate with the EMA is still not precise, the error due to localization seems to be 
much smaller for the graph sizes we tested. Note that as the average degree c increases the degree fluctuation of each 
vertex decreases because of the law of large numbers. Therefore, the effect of localization is expected to eventually 
disappear. That is, the point at which localization occurs will finally become buried in the undetectable region. 

As mentioned in Sec. IVIII the solid lines and crosses behave differently in Figs. fT2l c) and [Idl e). Moreover, the 
results of the saddle-point equation with the EMA seem to converge to the result in [8( around the detectability 
threshold. It should be noted, however, that this coincidence is due to the property of the Poisson degree distribution 
in the stochastic block model. As described in Appendix 0 the difference between these results with the EMA 
generally depends on the form of the degree distribution. 


X. SUMMARY 

In summary, we have analyzed the limitations of the spectral method for graph partitioning, known as the de¬ 
tectability threshold, and the localization of eigenvectors. We derived estimates for the detectability thresholds of 
the spectral method with the unnormalized Laplacian L [Eq. (1551) ] and the normalized Laplacian C [Eq. ([86]) ] for 
sparse graphs. The detectability threshold with the normalized Laplacian C can generally be written as (1551) (for 
equal size modules), which is analogous to the threshold for random regular graphs (1551) . This converges to the result 
in Q in the dense limit c -A oo. For the condition where a localized eigenvector emerges, although it is difficult 
for a graph with an arbitrary degree distribution, our estimates give a fairly good prediction for two-block random 
graphs with bimodal degree distributions. Overall, our estimates with the replica method agree with the numerical 
results quite well, as long as localization is absent, for the graph sizes we tested. It should be noted, however, that 
the localization of eigenvectors is expected to be sensitive to rare events such as the emergence of vertices of irregular 
degree. Therefore, when the support of the degree distribution is infinite, e.g., a Poisson distribution, we must be 
careful as the finite size effect may not be negligible. 

We revealed that the spectral method with Laplacians does not detect modules all the way down to the ultimate 
detectability threshold 2\/c in any sparse graph. In fact, the estimated gap between the precise detectability threshold 
and the ultimate one is already considerable for very sparse graphs even in the case where eigenvector localization is 
absent or negligible (see Fig. m- Another way of viewing this result is that the method with the non-backtracking 
matrix closed the gap of detectability in two ways. Furthermore, for the graph sizes we tested, the effect of localization 
was relatively weak in the stochastic block model when the average degree was not very low. Finally, we comment that 
we must be careful when we compare the performance between the normalized Laplacian and modularity. Although 
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FIG. 12: (Color online) (a) Average second-smallest eigenvalue of the method with the normalized Laplacian C, (b) the IPR 
of its eigenvector, and (c) the fraction of correctly classified vertices with the spectral method in the stochastic block model. 
We set the average degree c = 6. In each plot, the dots represent the numerical results with various graph sizes. In (a), the 
estimate of the eigenvalue with the EMA is represented by a solid line. The dotted line shows the estimated detectability 
threshold and the dashed line shows the estimate in (§jj]. In (c), the crosses represent the fraction of correctly classified vertices 
with the EMA and the solid line represents their Gaussian approximation (see Appendix [B] for details). The dashed line is, 
again, the estimate in Q. 


their spectral methods become equivalent for a certain choice of normalization E3. there is no guarantee that our 
results precisely coincide with the detectability threshold of the modularity matrix, i.e., the method with the spherical 
normalization [461 ]. 
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Appendix A: Number of random graphs with two modules 

In this section, we calculate Afc, the number of possible graph realizations. Although the value of Afc does not 
appear in the final result in the main text, the technique used here is essential for calculating the moment of the 
partition function. Let us first consider the two-block c-random regular graphs. We sum all the connection patterns 
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(b) 
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*-in c out 



FIG. 13: (Color online) Same plots as in Fig. 1121 but with the average degree c = 8. In this case, the effect of localization is 
weaker, and the estimate with the EMA is more precise. 


({tty}, {wij}) that satisfy the constraint of constant degree. That is, 

Afo= II 6 \ H Uil + H Wik ~ c ) II 5 \ H u 3 l + w i k ~ c ) 6 \ J 2 Wik ~ ] • (Al) 
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Setting the contours with respect to z, and 2 , to be small, we can approximate the last factors as 
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where we have neglected the diagonal term in (IA7I) because N » 1. Introducing the order parameters 
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Eq. (IA12I) becomes 


M- a = pmN 2 /‘Y® 
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(A14) 

In the limit TV —> 00 , the saddle point of the integrand gives A/c- The saddle-point conditions yield 



7 = Me~ v pip 2 qiq 2 , 

(A15) 


A^pigi + Np 2 e~ v q 2 - ft. = 0, 

(A16) 


A r P 2?2 + Mpie~ v qi - q 2 = 0 , 

(A17) 


<Mi = <? 2<?2 = c. 

(A18) 

We then have 
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Inserting the values at the saddle point, we obtain the number of graphs for N 1: 


Mg — exp 
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(A21) 


A completely analogous result holds for the number of possible two-block random graphs with a given degree 
sequence, i.e., the two-block configuration model or the degree-corrected ensembles with “hard” constraints [34}. We 
let {c t }f = i be the sequence of degrees, each of which has probability {b t }[ =1 -, i.e., the number of nodes with degree c t 
is Nb t . The number of graphs Mg can then be written as 
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where, as in the main text, we denote the set of vertices in module r with degree c t as V( r ,t)- A similar calculation to 
the single-degree case yields 
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where, as in the main text, c = &tCt is the average degree. The saddle-point conditions yield analogous results, 


7 = Afpigip 2 g 2 e _?? , 

(A24) 

-/Vpi<7i + Np 2 q 2 e~ v = qi, 

(A25) 

Npiqie~ v + iVp 2 gr 2 = <? 2 , 

(A26) 

?l<7l = 9252 = c, 

(A27) 






















24 


and 


<7i = 


92 = 


>cp 1-7 


7Vp2 


2 > 


1 CP 2 - 7 




Finally, we have 
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(A30) 


Appendix B: Gaussian approximation of the fraction of correctly classified vertices 


We consider the distribution of elements of the second-smallest eigenvector belonging to the rth module, averaged 
over the realization of the unnormalized Laplacian L. This is defined as 


P r (X) = — ^ [^O - *i)] L , 


iev r 


and, as mentioned in the main text, it can be expressed in terms of H and A as 


P r ( x) = j dAdH Q r (A,H) 6 . 


(Bl) 


(B2) 


For the Gaussian fitting of the distribution P r ( x ), we solve for the mean and variance in (IB2I) . To obtain an analytical 
expression, we fix the distribution of A, i.e., Q r (A) = J dHQ r (A, H) = d(< 2 f u n — A). Note that af u n here is different 
from a in the saddle-point equations. From the cavity interpretation of Eqs. Q r (A, H) can be regarded as 

the complete marginal distribution corresponding to q r (A,H). Therefore, instead of (l46l) . Of u n can be determined as 


^full — 0 

= (c- i)r- 


(c-l)F’ 


(B3) 


where we have inserted the values of <fi and a at the saddle point. Then, in the detectable region, the mean can be 
approximated as 
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where we have used the fact that i[> = 0 at the saddle point. 
Similarly, for the second moment, we have 
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Then, the variance reads 


s 2 r (x) = (x 2 ) r -(x 2 ) r 
(c-i)T 


(c- i) 2 r 2 -1 
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(m 2r - rhlr) ■ 


(B6) 


With m r {x) and s r (x), 1 + erf(TO r (x)/i/2s 2 (x)) /2 gives the fraction of correctly classified vertices in module r. For 
the equal-size modules, |mi(a;)| = \m 2 (x )\(=: |m(a;)|) and s 2 (:r) = s|(x)(=: s 2 (i)) by symmetry. The total fraction 
of correctly classified vertices is then 1 + ev{(\m(x)\/\J2s 2 (*)) A 

For the normalized Laplacian C with the EMA, the analogous calculation for the detectable region gives 
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where c n = J2t ^t c t ■ 


Appendix C: Approximations of the saddle-point equations and the free energy 

As shown in Figs. E^a), HUb), HHc), and[T3][c), the results obtained by the EMA of the saddle-point equation 
(crosses in the figures) and the Gaussian fitting of the distribution of the eigenvector elements (solid lines in the 
figures), whose mean and variance are estimated by the EMA of the free energy, are different. One may expect that 
the former is simply more accurate, as long as the stationary state is achieved by a sufficient number of iterations, 
because the latter contains a Gaussian approximation. However, this is not correct. In fact, the EMA of the saddle- 
point equations is not equivalent to the EMA of the free energy; in the latter, the approximation is applied before 
taking the saddle point. For example, the relation between the first moment of H, m\ r , and its conjugate mi r , derived 
by the saddle-point equation d79l) with the EMA, is 


mir = 



min 


(Cl) 


whereas we have mi r = (c — l)rhi r from the saddle point of the free energy with the EMA. Hence, the difference 
depends on the ratio of the mean and the variance, or the Fano factor. In the case of the Poisson degree distribution, 
Eq. (IC1I) reads m\ r = cmi r , which corresponds to the dense limit of the saddle point for the free energy. As neither 
quantity is exact, it is not readily obvious which offers the better estimate in general. 
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